Optimal positive-operator-valued measures for unambiguous state discrimination 
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Optimization of the mean efficiency for unambiguous (or error free) discrimination among N 
given linearly independent nonorthogonal states should be realized in a way to keep the probabilistic 
quantum mechanical interpretation. This imposes a condition on a certain matrix to be positive 
semidefinite. We reformulated this condition in such a way that the conditioned optimization 
problem for the mean efficiency was reduced to finding an unconditioned maximum of a function 
defined on a unit iV-sphere for equiprobable states and on an A-ellipsoid if the states are given 
with different probabilities. We established that for equiprobable states a point on the sphere with 
equal values of Cartesian coordinates, which we call symmetric point, plays a special role. Sufficient 
conditions for a vector set are formulated for which the mean efficiency for equiprobable states 
takes its maximal value at the symmetric point. This set, in particular, includes previously studied 
symmetric states. A subset of symmetric states, for which the optimal measurement corresponds to 
a POVM requiring a one-dimensional ancilla space is constructed. We presented our constructions of 
a POVM suitable for the ancilla space dimension varying from f till N and the Neumark's extension 
differing from the existing schemes by the property that it is straightforwardly applicable to the 
case when it is desirable to present the whole space system + ancilla as the tensor product of a 
two-dimensional ancilla space and the A-dimensional system space. 

PACS numbers: 



I. INTRODUCTION 

Being important for quantum communication, quan- 
tum cryptography and quantum algorithms (see e.g. [l[) 
the problem of fincling an optimal strategy for discrimi- 
nating among different non-orthogonal quantum states 
has gained renewed interest 0, y|, BL S B 0, HI (for 
reviews see 0, Between different strategies based 
on various criteria (see e. g. [|| ) a special role is 
played by the one called unambiguous state discrimina- 
tion @, 0, 0, H, IE 0> HI ■ It is based on pioneering works of 
Ivanovic, Dieks and Peres Q who considered the problem 
of discriminating among two non-orthogonal states. In 
9] Peres and Terno examined the case of three states. In 
7| Chefles showed that non-orthogonal states may be un- 
ambiguously discriminated if and only if they are linearly 
independent and proved an important fact that the nec- 
essary (but not sufficient) condition for a measurement 
to be optimal consists in equating to unity the highest 
eigenvalue of a certain matrix. Chefles and Barnett @ 
using this property derived an upper bound (achievable 
by an optimal measurement) for success probability to 
discriminate among N so called symmetric states given 
with equal a priori probabilities. 

Another important general result was established by 
Duan and Guo [9( who derived a matrix inequality for ef- 
ficiencies to unambiguously discriminate among N given 
linearly independent states. This result permitted to Sun 
et al [1CJ to reformulate the problem of finding the opti- 
mal discriminating strategy as the well-known semidefi- 
nite (conditioned) programming problem. Although this 
fact together with the remark that there exist softwares 
for numerical solution of the latter problem [l^, [HI re- 
duces any particular optimization task to a technical 
problem, this approach, as well as any numerical solution, 



usually does not bring any additional analytic insight. 

In the current paper we show that the conditioned op- 
timization problem for the mean efficiency may be refor- 
mulated as a problem of finding an unconditioned maxi- 
mum of a function defined on an Af-sphere for equiproba- 
ble states and on an A^-ellipsoid when the states are given 
with different probabilities (Theorem [T] and Corollary [TJ. 
We establish that for equiprobable states there exists a 
special point on the unit A^-sphere called a symmetric 
point where the mean efficiency takes its maximal value 
for certain sets of states. In particular, this set includes 
all symmetric state. It happens that for AT = 2,3 this 
point is optimal only for symmetric states. Nevertheless, 
as we will show, starting from A~ = 4 the set of states, 
for which the symmetric point is optimal, is wider than 
the set of symmetric states. Using Theorem [T] we formu- 
lated conditions on the set of states which are sufficient 
for the mean efficiency to be maximal at the symmetric 
point (Theorem [2]) . For N = 4 Theorem [2] is illustrated 
by an example of a three parameter set of states which 
are non-symmetric and such that one of the parameters 
is just the optimal mean efficiency. 

We show that if one of the eigenvalues of matrix tyty* , 
with '5 being an N x N matrix where the given vectors 
are collected as columns, is N — 1 fold degenerate, the 
set is either a particular set of symmetric states or their 
unitary rotated form. This set may be considered as the 
simplest generalization of two states since the expression 
for the optimal mean efficiency for two states is a particu- 
lar case of a more general formula obtained for the above 
set. Moreover, if this is the lowest eigenvalue, which is 
N — 1 fold degenerate, the ancilla space for the optimal 
measurement becomes one-dimensional. 

For a general set of symmetric states we obtain for 
the optimal mean efficiency an expression alternative but 



2 



equivalent to that found by Chefies and Barnett [8(. 

Finally, we present our constructions of a POVM suit- 
able for the ancilla space dimension varying from 1 till 
N and the Neumark's extension differing from the exist- 
ing schemes by the property that it is straightforwardly 
applicable to the case when it is desirable to present the 
whole space system + ancilla as the tensor product of 
a two-dimensional ancilla space and the iV-dimensional 
system space. 

II. OPTIMAL POVM 

Assume that we are given a set of non-orthogonal but 
linearly independent and normalized to unity vectors 
(states) ipj, (tpj\ipj) — 1, j = 1,...,N, which we col- 
lect as columns to an N x N matrix iff = (-01, . . . , i/jn)- 
Because of the linear independence of the vectors ipj the 
matrix '5 is non-singular, deff ^ 0, which is the prop- 
erty we will assume to hold throughout the paper. Below 
we will frequently use the following terminology. Instead 
of saying the vectors (states) tpi, . . . , ipN we will say the 
vectors (states) VP. We will assume also that these vec- 
tors span an N dimensional complex Hilbert space 7i 
with the usually defined inner product (-|-). Evidently, 
they form a non-orthogonal basis in Ti. As it was pre- 
viously shown @, i, 1, i, i, 0, 1 unambiguous (or error 
free) strategy for discriminating among these states, is 
possible if and only if they are discriminated by a gen- 
eralized measurement [2l[ (for reviews see e.g. [l2| )■ To 
construct a positive operator valued measure (POVM) 
to unambiguously discriminate among the states \E' it is 
convenient to introduce in TL the basis £fc biorthogonal 
to ipj (Chefies and Barnett § call reciprocal states) 
defined as (£k\' l Pj) = $kj- Evidently, being collected as 
columns to matrix 5 = . . , £jv) they may be found 
as S = (^t) -1 where ^ is a matrix Hermitian conjugate 
to fy. Note that multiplication of S from the right by a 
diagonal matrix 

X = diag(xi, . . . ,x N ) (1) 
corresponds to changing the norms of the vectors £j, 

Z x = EX = (tf)- 1 X, (2) 

leaving unchanged the orthogonality between and ipj 
for j 7^ k 

0>Mxk) =Xj5 jk . (3) 

By this reason without losing generality one may always 
assume Xj > 0, j = 1, . . . , N, the property we assume to 
hold throughout the paper. We shall call X the weight 
matrix and the Xj weights. 

Assume now we can associate detection operators with 
rEjj = I^iejKCxjI- Note that the matrix 

N 

= 53 l&i) (^1=3,4, (4) 



which is the left modulus of E x (see [l3[ for the defini- 
tion), is Hermitian and positive definite. Then because 
of orthogonality ([3]) detector ff x k clicks if and only if 
the state tpk is captured. This confirms that we can un- 
ambiguously discriminate ipk from all other states 
j k. The probability P/. for this detector to click 
is Pk — (ipkl^-xkl^k) which in view of ([3]) reduces to 

/' 4- 

This probabilistic interpretation is valid if and only if 
there exists a Hermitian positive definite (or semidefinite) 
matrix H x such that 

n, + u x = i N (5) 

where In is the N x N identity matrix. Relation ^ 
defines a (discrete in the current case) POVM which is 
the cornerstone of the probabilistic interpretation. From 
here it follows that tl x — 1^ ~ H x — In — £ x Sj. should 
be positive definite (or semidefinite). Note that this 
matrix is positive definite (semidefinite) together with 
In — S^S-e = In — X( , i'^)^ 1 X which, in turn, is posi- 
tive definite (semidefinite) together with $t^r_ x 2 . The 
last condition is just the same necessary and sufficient 
condition for an optimal POVM to exist first found by 
Duan and Guo [9(. We, thus, identify the square of our 
weight matrix X 2 with the efficiency matrix T used by 
Duan and Guo. 

Note that II X = In — H x is positive semidefinite if the 
highest eigenvalue of is equal to I. Thus, a POVM 
can always be constructed if X = X^In, (A m > 0) 
where A~ 2 is the highest eigenvalue of SH^ and X 2 n is 
the lowest eigenvalue of both the right modulus of 'J, 
= (StS) -1 , and its left modulus . Another 
important property we would like to emphasize is the 
following. When the mean efficiency takes its maximal 
value, the highest eigenvalue of H x is necessarily equal 
to 1 Q. Therefore, is always positive semidefinite if 
the POVM is optimal. As we will show below in a num- 
ber of cases this takes place for X = A„ Jjy. Moreover, 
rankfla; = N - N m := N a where N m = 1, . . . , N - 1 is 
the degree of degeneracy of the highest eigenvalue of H x 
(for X = A" 1 In this is also the degree of degeneracy of 
A m 2 ) so that N a = 1, . . . ,N - 1. 

If the state ifjj can be chosen from the set ^ with a 
probability r\j , the total probability of correctly identify- 
ing any state 

N N 

Y.-iji'j X '/,'•; (6) 

is called the mean efficiency. The optimization problem 
becomes now a semidefinite programming problem (see 
[Iol |) consisting in finding the maximum of P(x) with 
respect to X > subject to either _ x 2 > or 
equivalently 

I N -Z x Zt -/Ar-OJt)-!^- 1 >0. (7) 
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We shall denote this maximum by Pm- In contrast to 
approach proposed in [l(| the constraint ([7]), we are us- 
ing, permits us to reduce the conditioned optimization 
problem to finding the minimal value of a function de- 
fined on an iV-ellipsoid. For brevity in what follows we 
shall denote by x — (xi, . . . ,xn) Cartesian coordinates 
of a point from M. N and by t = (ti, . . . , ijv-i) curvilinear 
coordinates on either a unit TV-sphere or an iV-ellipsoid 
with fixed axes. 

Theorem 1. Let us denote 

N 

Pm = max /J%' a; j (8) 

(Jn-H.H^O)-^ 1 

and 

MM = minMm(*) ( 9 ) 

where /U m (i) is the highest eigenvalue of the matrix 
SY" 2 (i)E^ with Y(t) being a diagonal matrix with the non- 
zero entries equal coordinates of a point lying on the posi- 
tive part of an N -ellipsoid with the axes n 1 1 ^ 2 , . . . , rjpf 1 ^ 2 . 
We denote f2 this part of the ellipsoid, i. e. 





Y(t)=dmg( yi (t),.. 


■,VN(t)) , 




-1/2 .... 

= i] 1 sin ii sini2 . 


. . sintjv-2 sin<7v_ : 


2/2 (t) 


-1/2 .... 

= r] 2 sin ii sini2 . 


. . sintjv„2 costAr_ 


2/3 (*) 


-1/2 .... 

= 773 sinii sint2 . 


. . sintjv-3 costAr_ 


VN-l(t) 


-1/2 . , , 

= sinti costs 


(for N > 2) 


VN(t) 


-1/2 , 
= COStl 





(10) 



<t u ...,t N <tt/2. (12) 

T/ien 

p M = Ml/ ■ ( 13 ) 

Proof. First we note that 2j=i VjVj = 1- 

Let us put in © X = rF(<), i.e. = r Vj{t)i 

j = 1,...,N. This yields P(x) = r 2 and SA 2 ~t = 
r 2 SY" 2 (t)St. Since r 2 is simply a scaling factor between 
eigenvalues of SA" 2 St and SF 2 (t)S'l' this implies 

tf n =r- 2 j? m =P- 1 (x)ji 2 m (14) 

where /I^ is the highest eigenvalue of ZX 2 ^ . We note 
that the left hand side of Eq. ([14]) is a function de- 
fined on the ellipsoid, fi m = fj, m (t). Next, we shall use 
the property proven in Q: The necessary condition for 
the efficiency to be optimal is the highest eigenvalue of 
EX 2 Ev being equal to 1. (Under this condition the con- 
straint In — S x St > is automatically satisfied.) Let 
us put in (TT4"1) jj,^ = 1 Vt € Q. Under this condition 
Eq. (114|) defines parameter r and with it the mean 



efficiency P(x) as functions on the ellipsoid only, i.e. 
P = P(t) = r 2 (t) = fJ. m 2 (t). Thus, the maximal value 
of P{t) corresponds to the minimal value of n 2 yl (t) on the 
part of the ellipsoid with all yi{t) > 0. □ 

For equiprobable states, when rji = . . . = tjn = 1/N, 
we find it convenient to re-scale all yj (t) and use yj (t) = 
^/~Nyj{t) so that the point (j/i(t), • ■ • , VN(t)) lays on a 
unit iV-sphere. To simplify notations below in this case 
we will use just the unit sphere and omit tilde over yj(t). 
This leads to the following modification of Theorem [1] 

Corollary 1. For equiprobable i.e. for r\\ — r]2 — 
... '/ \ A '. 

Pm = N~ 1 max ^(t) (15) 

where ft is a part of the unit N -sphere defined by ill]) 
and \12\) with r\\ — ... = t]n = 1, ^ 2 m (t) is the highest 
eigenvalue of the matrix SY 2 (i)E^ , and Y(t) is given in 

We would like to stress that the above condition for the 
highest eigenvalue of EX 2 E^ be equal to 1 proven in Q is 
necessary but not sufficient for the efficiency to take the 
maximal value. As it is clear from the proof of Theorem 
[TJ for P(t) = /J.^(t) this necessary condition is satisfied 
Vt £ H and, in general, P(t) is not constant for t € CI, 
which can be easily seen from the simplest 2x2 example. 
From ([14]) it follows also that, in general, P(x) = Jl^fj,^ 
depends on both a point t on the ellipsoid and parameter 
r. 

On the ellipsoid there always exists a special point to 
where y x (to) = y% (to) = ... = yjv(*o). We will call 
this point the symmetric point. At this point matrix 
Y 2 (t)\ t =t becomes proportional to the identity matrix 
and EX 2 ^ is proportional to SST. By this reason the 
eigenvalues of 3X 2 ST are, simply, the eigenvalues of SEv 
scaled by a factor r 2 yf(t ). In particular, 

Mm = r 2 2/i(*o>m. (16) 

where is the highest eigenvalue of HHt which is given 
through the matrix "J since 5 = ( 1 i'^)~ 1 . If now we re- 
quire for the highest eigenvalue of EX 2 ^ be equal to 1, 
Jl 2 n = 1, Eq. (TIB)) defines r 2 as a function of to- Therefore 
from Eqs. (|14[) and (TIB)) we obtain the mean efficiency 
P(to) = r2 (to) = yi 2 (to) v m 2 - F° r equiprobable states, 
as we show below, in a number of cases point to is just 
the maximal point for the mean efficiency. In particular, 
this point is optimal for any symmetric set of equiproba- 
ble states but not only for these states. The set of states 
for which the mean efficiency takes its maximal value at 
t = to is wider than the set of symmetric states. How- 
ever, as our numerical tests show, in many cases there 
exist points on the ellipsoid where the efficiency takes 
higher values. We believe that the symmetric point may 
be a good starting point for a numerical optimization. 

Theorem [T] and Corollary [T] give a general recipe for 
finding the optimal weight matrix X(t) and with it the 
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optimal POVM defined by the set E x = EX(t) to un- 
ambiguously discriminate among the vectors collected to 
matrix if?. In general the matrix 'J is non-Hermitian and 
can have complex eigenvalues. Nevertheless, as we show 
below, for any given if? there always exists a positive def- 
inite \I/ 1 = which has the same optimal weight matrix 
and the same optimal mean efficiency as if 1 . 

Corollary 2. Both the optimal weight matrix X = Xm 
(QJ) and the optimal mean efficiency Pm ire invariant un- 
der a unitary rotation of the vector set if? . In particular, 
for any non-singular if> given in the form 

* = U 2 E U X , U$ = U{ 2 , (17) 
E = diag(E ,i, E , N ) , E 0tj > 0,j = 1, . . . , A(18) 

the optimization may be realized for = U\EqUi. 

Proof. First wc note that decomposition (fl~?| . (fTS)) takes 
place for any non-singular matrix if? (see e.g. [131]). De- 
note t = tjrf the point on the ellipsoid where the highest 
eigenvalue fJt%,(t) of the matrix 'EY 2 (t)'E^ takes its min- 
imal value as a function of t. Next, since S = (\E'T)- 1 ) 
this implies EY 2 (t)E^ = U a E^ 1 U 1 Y 2 (t)V^Eo 1 U^. From 
here it follows that the eigenvalues of matrix SF 2 (tjv/)S'l' 
do not depend on unitary matrix {/ 2 . In particular, the 
eigenvalue — fi^tu) ® and with it, as it follows 
from (|13|> . both the optimal mean efficiency Pm and the 
weight matrix Xm = V PmY^m) are invariant under a 
unitary rotation of the vectors if?. Therefore the opti- 
mization may be realized for the set \&i = u\EoUi. □ 

Note that in general properties of the vectors collected 
to a matrix unitarily equivalent to a given matrix change 
essentially. In particular, if the vectors if? are normalized 
to unity, the vectors U^U\ in general, are not normal- 
ized to unity. Nevertheless, there exist particular unitary 
transformations keeping unchanged both the mean effi- 
ciency and the weight matrix. These are transformations 
from an Abelian subgroup of the unitary group. They 
have the form U = U a = diag(e 4/31 , . . . ,e J/3 ~), /3j £ M. 
Indeed, being multiplied from the right by U a every col- 
umn vector of \f r acquires an additional inessential phase 
factor whereas after the left multiplication the vectors if? 
are simply unitary rotated. Thus, such a transformation 
corresponds to a re-scaling of the vector set if' by phase 
factors followed by their unitary rotation. According to 
Corollary [2] these operations can affect neither the mean 
efficiency nor the weight matrix. 

The next theorem establishes sufficient conditions for 
the symmetric point on the A-sphere to be a point of 
the maximal mean efficiency for equiprobable states. To 
prove this result, according to Corollary [T] one has to 
find the minimum of the highest eigenvalue of the matrix 
SY" 2 (i)S^ for t on the unit A-sphere. 

Theorem 2. Let if? be Hermitian, positive definite and 
the states if? are equiprobable. If there exists an eigenvec- 
tor<p° m = Wv>°J = 1, of if? corre- 

sponding to its lowest eigenvalue X m , i.e. if?f j = Xjfp 



X m = min^Xj, such that |¥>m,il = \v>m,2\ = ■■■ = 
Kwl=W~ 1/2 i thenP M = \ 2 m . 

Proof. First we note that because of Hermiticity of if?, 
matrices \E"I ft = vfrtijr = vfr 2 , S = if- 1 an d E^E = 
EST = S 2 may have the same set of eigenvectors. In 
particular, ~Vm = A"Vm and S3 Vm = so 
that A~ 2 is the highest eigenvalue of SSL 

Let us denote H(t) = SF 2 (t)St where Y(t) is given m 
(fTU| with yj(t) being given in (fTTj) and (fl~2)l with r]i = 
. . . = ijn = 1 so that YljUjit) = 1- Let also /i^(t) be 
the highest eigenvalue of H(t) and f m {t) be one of the 
eigenvectors with eigenvalue /^(t). i.e. 

H(t)i Pm (t)=fi 2 m (t) ( p m (t). (19) 

Since H 2 n {t) is the highest eigenvalue then according to 
the variational principle the following inequality \j? m (t) > 
(ip\H(t)\ip) holds true VV S Ti.. In particular, for any t 
we have 

&(t) > {<p° m \H(t)\<p° m ) = ( V m \EY\t)E^° m ) 

N 

= A m 2 <^„|y 2 (t)|^ l )=A m 2 ^|< l | 2 2/ 2 = A- 1 A m 2 .(20) 

i=l 

Let to be a point on the unit sphere where y% (to) — 
•■■ = VN(h) = A" 1 / 2 and, hence, Y(t) = A" 1 / 2 ^. 
This implies that ip m (to) — <jP m and /^(to) = 
(<p m \H(to)\<p° n ) = A-^J-St^) = A^A- 2 . Com- 
paring this result with (|20|) we conclude that /z m 2 (io) > 
^i~ 2 (£) for any t and, hence, according to Corollary Q] 
P M = [NfiKtoT 1 = A 2 „. □ 

This theorem has an interesting implication which we 
formulate as the next theorem. 

Theorem 3. Let matrix if> be such that matrix if?if?^ 
has only two distinct eigenvalues A 2 and X 2 ^ and one of 
them is A— 1 fold degenerate, then for equiprobable states 
P M = mm(Af, X 2 N ). 

We would like to remind that the positive square roots 
Xj of the eigenvalues of the left modulus of if 1 , i.e. of 
the eigenvalues A 2 of if?if?\ are called singular numbers 
of if? [l3| . Thus, by the theorem assumption one of these 
numbers is A — 1 fold degenerate. 

Proof. Let for definiteness X 2 be A — 1 fold degener- 
ate. Using Corollary [5] without losing generality one 
may assume if? to be Hermitian and positive definite, 
* = WE U, W = U- 1 = (u kij ), Eo = diag^oj), 
E ,j = Ai > 0, j = 1,...,A~- 1, E ,n = X N > 0, 
so that matrix ^if? = = if? 2 has eigenvectors 

U'. Thus the first A — 1 vectors Uk = (u\,u, ■ ■ ■ ,UN,kY, 
k = 1, . . . , A — 1 collected as columns to W correspond 
to the eigenvalue Ai of if> and the last column of U\ 
u N = (ui : n, ■ ■ ■ : u n,nY, corresponds to E 0:N = X N . 
Moreover, since any unitary transformation ^ — > JJif?, 
U^ 1 = U' does not affect the norms of the vectors, 
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diagonal entries of \& 2 = \f f t\I f remain equal to 1 af- 
ter any unitary rotation. (Note nevertheless that the 
whole overlap matrix = * 2 remain unchanged af- 
ter such a transformation.) Below we prove that un- 
der these conditions the eigenvector of ^ 2 corre- 
sponding to A 2 ^ has coordinates with equal absolute val- 
ues. Moreover, in the eigensubspace of 'I' 2 correspond- 
ing to eigenvalue A 2 there always exists an eigenvec- 
tor U N -i = [u lt N-l,---,UN,N-l) t , (ujV-l|«N-l) = 1, 

^itjv-i = AiWiv-i, such that |ui,jv-i| = |i*2,jv— ij = 
• ■ ■ = |t*jv,jv— 1| = N^ 1 / 2 . Then the statement will follow 
from Theorem [2] 

Condition ("$> 2 )k,k = 1 or explicitly 

N-l 

^iJ2 KjI 2 + 1^1^ = 1, k=l,...,N (21) 

3=1 

and unitarity of U, Y^j^i \ u k,j\ 2 + |«fc,jv| 2 = 1, imply 

K,;v| 2 = X 2~ A L k=l,...,N. (22) 
Using this result and unitarity of U once again one gets 

UfciJV = -L e fa *.* , fc = i J ...,JV J (23) 
v iv 

where real phases a^jv are defined by a concrete choice 
of*. 

From (f2"2")l and (|2"3"|) we find a link between eigenvalues 
A 2 and A 2 ^ 

A^ + (iV-l)A 2 = iV, (24) 

which results in a restriction for the lowest eigenvalue 
< min(A 2 ,A^) < 1. 

Since by assumption the first N — 1 vectors collected 
to TP correspond to the same eigenvalue of 4" there 
is a freedom to choose in the eigensubspace spanned 
by these vectors any orthonormal basis. For our pur- 
pose it is sufficient to show that there exist real phases 
&k,N-i> k = 1, ...,iV such that a vector un-i = 
(ui,iv-i) • • • j u N,N-l) , 

Ufc.jv-1 = -4= ^"• N - 1 , fc=l,...,JV (25) 

V JV 

is orthogonal to the vector Uk,N p3]l. Indeed, from [[23]) 
and (|25|) one obtains 

AT 

^ e <(«*,jv-«*,w-i) = o . (26) 
fc=i 

This equation means that a k ,N — &k,N—ii k = 1, . . . ,N 
is (k — l)st root of unity, i.e. a kt N — a fc,iV-i = ^-{k — 1) 
so that 

afe,iv-i = Qife.iv - -jr(k - 1) , fc = l, ...,N. (27) 

We, thus, explicitly constructed the vector u^-i from the 
eigensubspace corresponding to the eigenvalue Ai having 
the necessary form. □ 



From the proof of this theorem one can extract a 
particular representation of a vector set * when W 
is Hermitian, positive definite and has only two dis- 
tinct eigenvalues. First we note that using proper re- 
scaling and unitary rotation one can always guaran- 
tee equal values of all coordinates of the vector un, 
i.e. un = (1, ■ ■ ■ , lY/y/N. By this reason the vectors 
Uj = (exp(iaij), . . . ,exp(iaAr J ))*/v / ^V, j = 1, ...,N— 1 
with a k , 3 -i = a k ,j - 2n(k ~ l)/N, k = 1, . . . , N, j = 
N, N—l, . . . , 2 are orthogonal to un since they differ from 
each other only by permutations of coordinates. More- 
over, it is straightforward to check that together with un 
they form an orthonormal set in TIn- (They correspond 
to all N roots of unity.) Using property ((26)) of roots of 
unity one can find the entries ipj tk of * = WEqII 

iP j , k = (\ N -\i)/N, jjtk, (28) 
i> k , k = [\ N + (N-l)\i]/N, j,k = l,...,N (29) 

where Ai and Ajv should satisfy Eq. (|24|) . 

Note that the inner products of the vectors * pgj). (125)) 
are equal to the same value (tpi\ipj) — s = (\ N — A 2 )/-/V, 
* 3i \ s \ < 1- Using Eq. ([24]) one can express eigenvalues 
of * 2 in terms of s as \\ — 1 — s, \ 2 N = \ 2 +Ns. In turn, 
the optimal mean efficiency in terms of s reads 

P M = 1-s + N(s- \s\)/2. (30) 

Our last comment here is that N a = rank IT, = 1 for 
s > and N a = N - 1 for s < 0. 

III. PARTICULAR CASES 

In this section we show that for N = 2 and N = 3 
the maximal mean efficiency obtained from Theorem [5] 
corresponds to a set of symmetric equiprobable states 
and this theorem may lead to a wider set of equiprobable 
states starting from N = 4. In Section IIII AI we illus- 
trate our theorems for the case of two non-orthogonal 
states. In particular, if the states are given with different 
prior probabilities we specify a domain in the parameter 
space when Theorem [1] results in a von Neumann mea- 
surement. In the sections IIII Bl IIII CI and IIII Dl we con- 
sider equiprobable states only. In Section IIIIBI we con- 
struct a four-parameter set of three vectors normalized 
to unity and described by a Hermitian positive definite 
3x3 matrix and present an example of states for which 
the symmetric point is not optimal. In Section IlII CI we 
construct a three parameter set of four normalized to 
unity vectors described by a 4 x 4 Hermitian matrix with 
the optimal point being the symmetric point. In Section 
IIII D I we show that for any set of N symmetric states the 
symmetric point is the maximal point for the mean effi- 
ciency and re-derive the result obtained by Chefles and 
Barnett Q for the value of the maximal mean efficiency. 
We obtain also an alternative formula for the maximal 
mean efficiency which may be useful when the states are 
given in a form different from that used by Chefles and 
Barnett 0. 
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N = 2 



value of EY 2 (t)E 



Although the case of two states is well studied (see e.g. 
0, [H, H, [T(j ) we find it useful for illustrating our theorems 
above. 

The most general Hcrmitian 2x2 matrix '5 composed 
of the vectors normalized to unity has the form 



* = 



Vi^T 2 

e ia r 



VT^T 2 



i 



. 0<r< 7f- ( 31 ) 



The inequality in (|3 1 1) guarantees the positivity of eigen- 
values of 'J, Ai : 2 = VT — f 2 =F r. 

First we note that using a matrix U a = 
diag (1, exp(— i/3)) for rescaling and unitarily rotat- 
ing the set (pJTj) , i.e '5 — > U a ^Ul, we can eliminate the 
inessential phase factors in the off-diagonal entries of 
"J. Therefore in (|3"Tj) we can put a = 0. Now we see 
that if for matrix (f3Tj) with a = we displace the first 
coordinate of the vector ipi to the next position and 
put its last (second in the current case) coordinate at 
the first place, we obtain ?/; 2 . The same transformation 
applied to tp2 gives ip%. Since this transformation is 
unitary, the states (f3"Tj) are symmetric (see Q for a 
definition). 

For equiprobable states using Y(t) — diag(sin£, cost) 
we obtain the highest eigenvalue of EY 2 (t)E = 

t-iy 2 ^)*- 1 



2 - V2 [1 + 4r 2 - 4r 4 + (1 - 2r 



2\2 , 



3 (4t)] 



1/2 



4(1 -2r 2 ) 2 



(32) 

which has a minimum (/i TO 2 (£) has a maximum) at £ = 
7r/4. Note that this is just the symmetric point on the 
unit circle and, hence, for N = 2 the mean efficiency ac- 
quires the maximal value at the symmetric point. Thus, 
using Corollary[T]one obtains the optimal mean efficiency 
P M = 1 - 2rVT _ 7^= 1 - |<Vi|-02>|- 

The coordinates of eigenvectors of 'J', «i :2 = (Tl + 
e m )*/\/2 have equal absolute values. Therefore according 
to Theorem^ P M = min(A 2 , A|) = (a/I -r 2 -r) 2 which 
is just the same as above. 

For N — 2 the degree of degeneracy of the lowest eigen- 
value assumed in Theorem [3] is N — 1 = 1. Thus using 
Eq. (|3T)|) for N = 2 one obtains the same value once 
again Pm = 1 — |s| = 1 — | (Vi I Utlie result previously 
reported by numerous authors @, [H, H, EH • This means 
that Eq. (f3TJ)) presents the simplest generalization from 
iV = 2 to an arbitrary iV. 

For states given with different probabilities 771 and r\i = 
1 — 771, 771 7^ 1/2 it is instructive to illustrate the case 
when TheoremQ]leads to a projective (i.e. von Neumann) 
measurement. 

First we note that without losing generality one may 
assume 1/2 < r]\ < 1. Then choosing Y(t) = 



diag(r; 1 ^ 2 sinf, rj 2 1/z cost) we find the highest eigen- 



-1/2 



cos(2i)(27?i - 1) - 2G(t) 



4?7i»7 2 (l - 2r 



2^2 



(33) 



where 



G(t) = [| + (m - |) cos(2<)] 2 - 77^2(2^ - l) 2 sin 2 (2i) . 

Function (|33[) may have minima only at points where 
d^ji 2 n {t) / dt — 0, i.e. at points satisfying the equation 



sin(2t)Gi(i) = 



(35) 



where 



2r\ x - 1 + (1 - 16r 2 )(l - r 2 )^^ 



G 1 (i)=4j 7l -2 + 

(36) 

In particular this may happen at t — and t — tt/2 where 
the function /i^,(t) takes the values 



/ i 2 „(0) = (l-2r 2 ) 



2\-2 -1 



/ z 2 ra (f) = (l-2r 2 )-V 1 <^(0). 



(37) 



Note that at these points either x\ = or X2 = mean- 
ing that either detection operator II^i or EL^ does not 
participate at the optimal measurement and can be omit- 
ted from POVM so that the resolution of the identity ([5]) 
contains in this case two terms only. For one-dimensional 
operators and N — 2 this necessarily leads to orthopro- 
jectors as detection operators and, hence, to a von Neu- 
mann measurement. 

According to ((33)) and (|34j) we have /x 2 ^ (t) = ^ (vr — t) 
and therefore t = ^ cannot be an inflexion point for this 
function. Moreover there exists no more than one point 
t G (0,tt/2) where d^ m (t)/dt := (d^ 2 m {t) / dt)\ t=i = 0. 
This follows from the fact that this point should be found 
from the equation G\(t) — with Gi(t) given in |36|) 
where, according to (f34|) . G(i) is a second order polyno- 
mial on cos(2t). By this reason to specify a domain in 
the space of parameters (r, rji ) where the optimal mea- 
surement is a von Neumann measurement it is sufficient 
to find the values of (r,r)i) such that t = ^ is just the 
point of a minimum for /^(t), i.e. the point where 
d 2 n 2 m (Tr/2)/dt 2 > 0. Thus from Eqs. J3SJ) and §M§ we 
find the condition 



2 + 2(4r 4 - 4r 2 - 1)771 > 



or equivalently 



1 + 4r 2 - 4r 4 



< m < 1 



(38) 



(39) 



From Eq. I|37p we find the optimal mean efficiency for 
this measurement Pm = »7i(l — 2r 2 ) 2 = ?7i(l — (V'i|'i/'2) 2 ) 



2fir 



which is just the "von Neumann value" 771 cos ( 
r]i sin 2 9 where is the angle between the vectors ip\ and 

■02- 

Below we will consider equiprobable states only. 
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B. TV = 3 

The most general Hermitian 3x3 matrix if? = 
(jpi i ip2> ^3)1 contains three real and three complex pa- 
rameters. Since the vectors ipj are assumed to be nor- 
malized to unity, the diagonal entries of \& can always 
be expressed in term of off-diagonal elements. Moreover, 
one can always re-scale the vectors ip2 and tps such that 
their first components become real. After re-scaling the 
Hermiticity of $ can be restored by a proper unitary ro- 
tation of the vectors . Thus, in the current case without 
losing generality matrices \& may be assumed to form a 
four-parameter family with a real parametrization. In 
general, eigenvalues of such a matrix should be found 
from a third order algebraic equation. Although any 
third order algebraic equation can be solved analytically, 
solutions, in general, are rather complicated and difficult 
for analyzing. Therefore below we show that there exists 
a parametrization where both the eigenvalues and eigen- 
vectors of 'J have a rather simple form but first we find 
convenient to illustrate our Theorem [5] 

Let Uj — (ui > j,U2j,U3j) t , and A 3 > 0, j = 1,2,3 
be eigenvectors and eigenvalues of \P respectively so 
that ^ = U^EoU^, where J/j = (ui,U2,U3) and E = 
diag(Ai, A2, A3). According to Theorem [2] the compo- 
nents of (for instance) vector 1*3, should have equal ab- 
solute values. First we note that since the state vectors 
are defined up to phase factors one can always choose 
M3.3 = l/\/3- Then using a similar re-scaling and unitary 
rotation of the vectors 'f? as discussed above, i.e. W — > 
U a W\ = U a ulW a E U a U 3 W a with U a = diag[exp(i/%)], 
j = 1,2,3, one can always guarantee the choice 1*1,3 = 
1*2,3 > 0. Since this transformation does not affect 113,3, 
from here it follows that 1*1,3 = u 2,3 = ^3,3 = 1/V3- 
Since if? is Hermitian, its eigenvectors Uj are orthogonal 
to each other. Therefore the vectors U\ and u 2 can be 
obtained by applying a general transformation (rotation) 
from the group SU2 to arbitrary two orthonormal vectors 
chosen to be orthogonal to the vector u 3 = (1,1, l)'/\/3- 
In particular, one can choose u% — (— 2, 1, l)*/\/6 and 
u% = (0, 1, — l)'/v2- In this way we obtain the following 
unitary matrix 



3 1 v za 



-K* 

3 xy 



V6 za 



V2 X V 



J-K* 4- -±-K* 



\ Ve za ,Fi x y 



V2 



— K* 

■s/6 x v 



— K* 

V2 za 



1 

V3 



(40) 

where K za — cos j + iK z sin ~, K xy = (K x + iK y ) sin ^ 
and we have used the usual parametrization for the SU2 
group [3] with < a < 2tt and K x , K y , K z being the 
coordinates of a real unit 3-dimensional vector, K x +K y + 
K z = 1, K XtViZ e E. Below we find more convenient to 
use two complex parameters K xy and K za subject to the 
normalization condition 



\K X 



\K. 



1 



(41) 



The normalization condition for the vectors if?, 
(^ 2 )j,j — 1, or explicitly 



A' 



5» fcj | 2 = l, A; = 1,2,..., TV, N = 3, (42) 



may be considered as a system of linear inhomogeneous 
equations with respect to eigenvalues A 2 . Note that be- 
cause of the unitarity of J/3 this system always has a 
solution Af = A| = A| = 1. This trivial solution is 
not suitable for our purposes and we need another one. 
This means that system (|42|) should be linearly depen- 
dent with a vanishing main determinant, det(\ukj | 2 ) =0. 
In this case one of the eigenvalues, say A3, is arbitrary 
while two others are linear functions of A 2 , . It is straight- 
forward to check that for Uk.j given in (|40p the determi- 
nant vanishes and from the system (|42[) we find 



\2 _ \2 _ 

A i — A 2 — 



\ 2 

A 3 



X 2 
A 3 



(43) 



Since we assume if! to be positive definite, this means 
that Ai = A2 and the unitary rotation of the vectors u\ 
and 1*2 considered above does not affect if? . Thus the 
vectors if? are parameterized by one parameter (A3) only, 



A A 1 A x 
* = I Ax .1,, Ax 

Ax Ax A 



(44) 



where 



A = ~(Aj +A2 + A3), Ax = -(X 3 -X 2 ). (45) 

From here we see that similarly to the previous section 
the coordinates of the vector ^2 are obtained from the 
coordinates of the vector ipx by a simple permutation 
(denote it P) and the coordinates of the vector V>3 are 
obtained from the coordinates of the vector ip2 by the 
same permutation P. Applying P to the vector ip 3 one 
obtains ipx- This means that P 3 — I3 (identity). Since 
such a permutation of coordinates is a unitary transfor- 
mation, the set (|4"4"|) is a particular case of symmetric 
states (for a definition see e.g. ||). From theorems [5] 
and [3] for the states (|4"4"|) we find the optimal mean ef- 



ficiency Pm = A3 for < A 3 < 1 and Pm = 



for 



1 < A3 < v3. Note that the vectors (|44| have equal inner 
products s = (ipj\ipk) = 3(^3 - A 2 ), j ^ k, -1/2 < s < 1 
and the optimal mean efficiency is obtained from Eq. (|50|) 
at N = 3. 

The condition \\ ^ A 2 is incompatible with the system 
(|42l when it is considered as a system of linear equations 
with respect to A 2 . Nevertheless, for \\ ^ A 2 , this system 
has a solution with respect to SU2 group parameters. In 
particular, after simple calculations one finds 



\K, 



A", 



(46) 



8 



For this parameter set matrix "J assumes the form 

A A 2 A* 



A\ A A 2 
A 2 A* A Q 



(47) 



(ii, x 2 , X3) = (0.67, 0.43, 0.99) and the optimal mean effi- 
ciency Pm — 0.535. The mean efficiency at the symmet- 
ric point t = (arccos^=,7r/4), P(t ) = A 2 = 1/2 < P M , 
is not optimal. 



where Aq is given in (|45|) and 
1 



An 



r (A 3 - e l 3 A 2 



(48) 



for the upper sign in (|46[) . ft follows from (|47|) that the 
vectors 'J have the same permutation symmetry as in the 
previous case. Hence, this is a set of symmetric states 
also. The same conclusion takes place for the lower sign 
in ((461). Note that A 2 defined by (J48]) cannot be real. 
Therefore the vectors (|4"4"]) are not a particular case of 
the set (|47[) . Note also that the parameters (Ai,A 2 ,As), 
defining the vectors (|4"7|) , should be such that A 2 + A| + 
A§ = 1. Otherwise the normalization condition of the 
vectors is violated. 

Thus we have proven that for N = 3 any set of states 
satisfying Theorem [5] is a set of symmetric states. In 
Section IIII CI we shall show that for TV > 3 this theorem 
describes a wider set of states than the set of symmetric 
states. 

Before illustrating Theorem [T] we will construct a gen- 
eral set of three vectors normalized to unity and described 
by a Hermitian positive definite 3x3 matrix ^ . This is 
possible if instead of U$ given in (|4"0|) we will use U3 so 
that now ^ — U3E0U3. The normalization condition for 
this set is given by the same Eq. (|4"2"|) where the matrix of 
the main determinant is replaced by its transposed form, 
i.e. Uk,j — > Uj,k' Since such a replacement does not affect 
the main determinant, the system remains to be overfull 
and one can express, for instance, Af and X 2 in terms of 
A| 



Al 



A 



i-3K 2 | 2 + a!(k 2 | 2 -k 2 | 2 ) 



A? 



Pl,2| 

A 2 . 



"2,2 



(49) 
(50) 



Thus, we have obtained a four-parameter set of matrices 
where three parameters come from SU 2 group and one 

parameter is A3 > 0. 

In particular, the choice K xy = and K za — exp(j7r/4) 

selects from this set a one parameter (A3) subset 



6 + A + -iV3A_ (1-«)A+ 
* -I iv^A- 3(2 + A+) (l+i)y/3X- I |. r ,li 
(l + i)A+ (l-i)y/3X- 2(3 + A+) 



2, A_ 



\2 

A 3 



where we have abbreviated A + = A 2 + A 3 
A 2 — A3. The eigenvalues of VP are Ai = 1, A 2 
and A3. 

As a numerical example we choose A| = 3/2. The 
optimal point on the sphere is found using Corollary [TJ 
(ti,t 2 ) = (0.919, 0.992), which corresponds to the weights 



N 



We start with an orthonormal set of 4-vectors where 
one of the vectors has equal coordinates 



Ui = 



( -1 — v/2 1 \ 
-72 1 1 
0-1^1 
V V2 1 1/ 



(52) 



In general one can apply a 8-parameter transformation 
from the group SU3 to the vectors (ui,u 2 ,U3) to get a 
general orthonormal set having a vector with equal coor- 
dinates. For the sake of simplicity we will use SU 2 group 
once again and apply it to the vectors u\ and u 2 in a sim- 
ilar way as it was described in the previous section. In 
such a way one obtains the following elements of matrix 



4*: 






01,1 


= 2A 3 - 


hX A + X 1 \K xy \ 2 + X 2 \K za \ 2 , 


02,2 


= A 4 + 


X 2 \iV2K xy + K za \ 2 + X x \iK xy - ^2K za \ 2 


03,3 


= "01,1 




04,4 


= "02,2 


+ 4V2(Ai - A 2 ) lm(K* y K za ) , 


01,2 


= ~i>l 


1 + 2(A 3 + A 4 ) - iy/2(\i - X 2 )K* y K za , 


01,3 


= "01,1 


-4A 3 , 


01,4 


= 4(A 3 


+ A 4 ) - 2^1,1 - ip li2 , 


02,3 


= 01,2 




02,4 


= 0i,i 


- 02,2 + 01,3 + 01,4 - 01,2 > 


03,4 


= 01,4 





(53) 

One can easily check that after this transformation sys- 
tem (|4"2"j) at N — 4 is overfull and has a solution X\ — 
A 2 = A3 = 5 (4— A|) useless for our purposes since it cor- 
responds to symmetric states. To get another solution we 
will assume X\ ^ A 2 for i ^ j and solve the system (|42)) 
with respect to SU 2 group parameters thus obtaining 



\K*v\ 2 



A? 



x 2 

A 3 



A? 



A 2 



\2 
A 3 



Ai - A 2 



K — h 




(54) 
(55) 



With this set of parameters the vectors W = \ft have 
unit norm for arbitrary Xj provided Ylj=i A 2 — 4. For 
the upper sign in (|55[) the basic elements of the matrix 
4^f given in (|53|) read 



Ai + A 4 + 2A 



\ 2 X 2 
., . A 2- A 3 

3 ^ Ai+A 2 
A 3 )(A 2 -A 3 ) 



01 

02,2 = 01,1 
04,4 = 02,2 , 

ipi <2 = 2(A 3 + A 4 ) - -0i,i + 



Ai+A 2 



(56) 



2iB x 
A1+A2 
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where 



V2 



(57) 



Thus we have obtained a 3-parameter (A2,A3,A4) set of 
the vectors 

Note that Eq. ([51)1 imposes restrictions on possible 
values of Af . In particular, the following inequalities: 



a 2 < x 2 2 < xi < X\ 



(58) 



should hold. Here we imposed the condition that Af is 
the lowest eigenvalue of 'i' 2 . Since under the last restric- 
tion the eigenvector of v? 2 corresponding to the lowest 
eigenvalue has equal coordinates, we can apply Theorem 
[2j According to this theorem the optimal mean efficiency 
coincides with Af , Pm = A|, provided the parameters are 
ordered according to ([55]) and A 2 , + A 2 . + X\ < 4. 

Finally, to show that the vectors from this set are nei- 
ther symmetric states nor their unitary rotated form we 
give matrix 'J' 2 




B 3 B 2 

B 2 B 3 + iB ± 
1 B 2 



(59) 



B* B 3 - iB 1 B* 1 



i(2-A?- 



A 2 — iBi) and 



where B\ in given in (|57|) , B 2 

B 3 = 1 

trix "J 2 of the form (|59p cannot correspond to symmetric 
states 



■A3. In the next section we will show that a ma- 



D. Symmetric states 

In this section we prove that the mean efficiency for 
the equiprobable so called symmetric states (see e.g. @ 
for some discussion and literature overview) is maximal 
at the symmetric point on the unit sphere. 

As shown in Q symmetric states ipk may be defined 
in terms of any orthonormal basis {e^}, k = 0, . . . , N — 1 
(ek\ej) = Skj in Ti as follows 



vn 



N-l 
J=0 



ij, k = 0,...,N -I, (60) 



where coefficients Ck may be arbitrary complex numbers 
provided 



JV-l 



£m 2 = i. 



(61) 



fc=0 



In what follows we choose a representation where 
(eo, . . • , e^v-i) = In- Then using the fact that the addi- 
tional phase factors in (f6"0|) are either iVth roots of unity 



or their integer powers and therefore they satisfy the fol- 
lowing identity 



N-l 



J2^ i(j - j ' )k/N = NS jj ,, j,/ = 0,1,. 



(62) 



k=Q 

one easily sees that 

^t = jvdia g (|co| 2 ,... ) |c w _ 1 | 2 ) (63) 

where as above "J = (?po, . . . , tpN-i)- By the same reason 
and in view of condition (|61[) matrix has the form 

M 
A 2 

A 3 

A 4 



= A = 



/ A 


A N -i 


A n 


A 1 


A () 


A n 


A 2 


A! 


A 


A 3 


A 2 


At 


\ A N -i 


An-2 


A N 



(64) 



where A Q — 1 and 

N-l 

Ak = h\ 2 e 2n%3k/N , k = l,...,N-l. (65) 
3=0 

A matrix of the form is called circulant |l5]. For 
any complex Ak it has eigenvalues JT5J F(e h+1 ), k = 
0, . . . , N — 1 where 



N-l 



(66) 



F(x) = £ A k x k , e = e 2 ^ N 

k=0 

and eigenvectors 
1 N 

.^^^V, k=l,...,N, (67) 



ffc- 



i=i 



Av k = F(e k+i )v k , fc = 0,...,iV- 1. 



(68) 



From l[67|) it follows that if v k j are the coordinates of the 
vectors Vk then \vkj\ = N~ x / 2 for k,j = 0, . . . ,N — 1. 
Using Theorem [2j the fact that matrices "J/'I't and 
have the same set of eigenvalues and formula (|63| we 
obtain for P M just the value reported by Chefles and 
Barnett H 



J P M = A^xmin(|c | 2 ,...,|c JV _ 1 | 2 ) 



(69) 



Note that the entries of matrix (|64|) are inner products 
of the vectors ipj, i.e. A — ((ipilt/jj)). Therefore they 
are independent on a unitary rotation of the set \F By 
this reason, for any set of symmetric states ^, matrix of 
their inner products has always form (|64|) . The opposite 
statement, evidently, is also true. If for a set of states VP 
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the matrix has the form (|64|) , then the states are 
either symmetric states or their unitary rotated form. 
This follows from the fact that one of the square roots of 
matrix (|64[) is a Hermitian matrix of symmetric states. If 
for a Hermitian matrix B 2 = {B 2 y a Hermitian matrix 
B = B< is given, then any other Hermitian square root of 
B 2 , B = W , B 2 = B 2 , is defined up to a unitary matrix 
V = (V 1 )- 1 , B = VB such that VB = BVl 

The entries of a matrix given in (|28p and (f!?5)) corre- 
spond to a set of symmetric states. This means that any 
set of states satisfying Theorem [3J i.e. a set, for which 
one of the eigenvalues of the matrix ^vff is TV — 1 fold 
degenerate, is either a set of symmetric states or its uni- 
tary rotated form. From this viewpoint Theorem [3] gives 
another recipe how one can identify such a set when it 
is presented by a non-Hcrmitian matrix. From the other 
hand matrix ([55)1 does not have this form meaning that 
the states $ obtained in Section UlI Cl are not symmetric. 

Another remarkable property of matrix (|64[) we would 
like to emphasize is the following. Assume we are given 
vectors collected as the columns to matrix i.e. we 
put \t = (-01, • • • , ^n) — A. In this case, evidently, 
ipj+l = Ptpj, j = 1, ...,N- 1, ipi = P^ N where P 
displaces every coordinate of a vector to the next posi- 
tion and places the last coordinate in place of the first 
one. Such an operator is unitary and, hence, according 
to a definition of symmetric states (see e.g. Q) the states 
(I64[) are symmetric. Using Theorem [5] we can obtain an- 
other (but equivalent to expression for the optimal 
mean efficiency for symmetric states. It may be useful 
when symmetric states are given in a form different from 
that used by Chefles and Barnett Q. 

This possibility is based on the property of matrix 
to be normal, i.e. AA' = A' A which can be checked by 
a direct calculation. Therefore matrices A, A* and A'' A 
may have the same set of eigenvectors given in ([57| . Ab- 
solute values of coordinates of any eigenvector coincide 
and we can apply Theorem [5] to find the optimal mean 
efficiency. According to Corollary [5] for the optimization 
procedure we can replace ^ by positive definite Hermi- 
tian matrix ^ = ft EqII where diagonal matrix Eq con- 
tains absolute values of matrix A eigenvalues as non-zero 
entries, E 0:k = |F(e fc+1 )|, k = 0, 1, TV- 1 where F{x) 
and e are given in . To apply Theorem [5] the vectors 
^ = A should have unit norm. This condition is satisfied 
if Y^j=o I A? 1 2 = 1- Now using Corollary [T] we conclude 
that Pm = min(|i?o,o| 2 , ■ ■ • , \Eq,n~i\ 2 )- Parameters Ak 
should be chosen such that E ,k ^ 0, k = 0, . . . , TV — 1. 
Otherwise the set "J becomes linearly dependent. 

IV. POVM AND NEUMARK'S EXTENSION 

In the previous section we formulated conditions to be 
imposed on the set of the vectors $ when, from the one 
hand, the mean efficiency takes its maximal value at the 
symmetric point and from the other hand there exists 
a Hermitian positive semidefmite matrix satisfying 



([5]). Another important property we established is that 
although Tl x is an TV x TV matrix its rank may vary from 1 
till TV — 1 so that if we associate a set of the vectors with 
the columns of this matrix, this set is necessarily linearly 
dependent with a linearly independent subset spanning 
the space of dimension N a varying from 1 till TV — 1. 
Below we will discuss the so called Neumark's extension 
which, in particular, consists in considering an additional 
space called ancilla space related just with the set of the 
vectors collected to Tl x . In particular, the dimension of 
the ancilla space is N a — ranklL,,. 

The notion of an ancilla first introduced in quantum 
information theory [1 61 ] permits one to reduce general- 
ized measurements to conventional von Neumann mea- 
surements in a higher dimensional space [Tig . Usually 
this extended space is considered as the direct product of 
spaces of the system to be measured with another (auxil- 
iary) known system called the ancilla (see e.g. [l7|). But 
as is stressed in [l8[ it may have a wider sense as any 
extra degrees of freedom related with the Neumark's ex- 
tension theorem. Since there are two ways to extend the 
initial Hilbert space, there are two different realizations 
of the extended Hilbert space: (i) the extended space is 
the tensor product of the initial (i.e. system) space and 
the ancilla space (TPE) and (ii) the extended space is the 
direct sum of the initial space and the ancilla space (DSE) 
[l8j . As is stressed in [l8[ DSE is much more economical 
from the experimental viewpoint since in this case the di- 
mension of the extended space is much less than the cor- 
responding dimension for TPE. In particular, the authors 
[18| claim that to unambiguously discriminate among TV 
given linearly independent non-orthogonal states by DSE 
an (TV — l)-dimensional ancilla space is sufficient whereas 
for TPE TV 2 additional dimensions are necessary, where 
TV is just the space dimension of the main system, i.e. 
the number of the given linearly independent vectors col- 
lected to <3/. From the other hand, as it is pointed out by 
Preskill [12J, the tensor product extension is much more 
transparent from physical viewpoint since additional di- 
mensions can always be interpreted as extra degrees of 
freedom of the extended system (e.g. internal degrees 
of freedom like spin or external degrees of freedom corre- 
sponding to an interaction of the system with a reservoir) . 

As is stressed in Section [Til (-W — l)-dimensional an- 
cilla space is the maximal space appearing as result of 
the optimization procedure and the actual ancilla space 
dimension varies from 1 till TV — 1. Thus for TV > 2 we 
give a stronger limit to the minimal ancilla s pac e dimen- 
sion for the DSE-case than the one found in [18| . To the 
best of the author's knowledge in the current literature 
there are no explicit constructions of POVMs accepting 
ancilla spaces of such low dimension as 1 for TV > 2 [22j • 
The next section is just devoted to fill in this gap. We 
will present a general construction of a simple form for 
a POVM with the ancilla space dimension in the range 
between 1 and TV. 

Another point we would like to emphasize is the fol- 
lowing. According to the necessary condition Q for the 
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optimization of the mean efficiency the highest singular 
number of matrix H x should be equal to 1. Just this 
property leads to a linear dependence between the vec- 
tors Tl x . If after the optimization one can sacrifice the 
very optimal POVM in favor of a bit less optimal POVM 
where all the vectors collected to matrix E x are scaled 
by a factor a bit less than 1 (real scaling factor should 
be chosen in agreement with how much from the optimal 
efficiency one is able to sacrifice), then the rank of H x 
becomes equal to JV and the ancilla space acquires the 
same dimension as the system space (N a = JV). In such 
a case, as we show below by an explicit construction of 
the corresponding Neumark's extension, one may enjoy 
an advantage of the direct sum extension of two copies 
of the same TV-dimensional space being presented in the 
form of the tensor product of the two-dimensional an- 
cilla space and iV-dimensional system space with a clear 
physical meaning of the ancilla space as an additional 
(e.g. internal) degree of freedom of an extended system. 
Thus, similar to DSE-case, we indicate on a stronger limit 
of the ancilla space dimension for the TPE-case which is 
2 instead of JV + 1 found in [18[ . The dimensionality of 
the enlarged (system+ancilla) space in this case is 2JV 
versus JV(JV + 1) indicated in [18| . 

Below we will consider only the vectors 3 X and to sim- 
plify notations will omit subscript x. 



there is no need to rotate all JV vectors by transformation 
V. It is sufficient to use an arbitrary N a x N a unitary 
rotation instead of V. Below we explicitly construct a 
matrix H having this property. 

For N a < N denote as U unitary matrix bringing HH^ 
to a diagonal form 

E&U = US d (73) 

where S d is a diagonal matrix, in which the highest 
(JV — N a ) fold degenerate eigenvalue, which is equal to 
1, occupies the last (JV — JV Q ) positions. Then from ([70]) 
one finds 

SHt = U(I N - S d )U* . (74) 

It is clear that by construction only the first N a columns 
of matrix U(In — Sd) 1 / 2 are nonzero. Therefore, the 
solution we need is obtained if 5 is composed of all non- 
zero columns of U(lN-Sd) 1 ^ 2 , which we denote Hi, times 
an arbitrary unitary N a x N a matrix V, i.e. 

H = 5iV, V~ 1 = V^. (75) 

Note that all N a columns of JV x N a matrix H (|75| are 
linearly independent by construction. 



A. POVM 

Let the vectors & G 7i, i = 1, . . . , JV, collected to the 
matrix H, be given. Denote as H matrix with unknown 
linearly independent vectors £j G H, j — 1, . . . , N a < N 
such that the identity decomposition 

SSt + HHt = I N (70) 

holds. Since by assumption the vectors H are linearly 
independent rankH = JV but rankH = N a < JV and HHt is 
positive definite while HH^ may be positive both definite 
and semidefinite. In both cases matrix (Jjy — HH''') 1 / 2 
is well defined and Hermitian. Therefore from (f70|) it 
follows 

E = (I N -~^) 1/2 V, V^ 1 — (71) 

where V is an JV x JV arbitrary unitary matrix. In par- 
ticular, for N a = JV one may choose V — (HHt) _1 / 2 H 
leading to 

3 = HHt)V2 ( ast)-Va H = [(~~^ - I N ]^~ . 

(72) 

Note that the JV x JV matrix H (|T2"|) contains JV linearly 
independent vectors as columns only if N a = N. If N a < 
JV among JV columns of this matrix only JV Q columns 
are linearly independent. Since we want this matrix to 
contain only linearly independent columns it should be 
rectangular of dimension JV x N a in this case. Moreover, 



B. Neumark's extension 

Neumark [l9| proved a very general statement concern- 
ing a representation of an additive operator- valued func- 
tion in a unitary space in terms of orthogonal spectral 
functions in higher spaces. In the context of our problem 
it, in particular, means that any non-orthogonal identity 
decomposition in the space TIn may be presented as a 
projection from a space of a higher dimension of an or- 
thogonal identity decomposition (see e.g. [l2j ). Although 
there exists a number of constructions extending a given 
non-orthogonal identity decomposition to an or thog onal 
one in a higher dimensional space (see e.g. [l2l Il8l |20|) 
they are based on either the tensor product extension 
[l2| or are not suitable for establishing an isomorphism 
between Hilbert spaces Hn (BTi-N and H.2®'Hn- There- 
fore in this section we give an explicit realization of the 
Neumark's theorem suitable for this purpose. 

Let us consider (JV + N a ) x (JV + N a ) matrix (JV a < JV) 

(^) 

As it is noted by Preskill [l2j the rows 

(Zk,l,. ■ ■ ,Zk,N,Vk,l, ■ ■ ■ ,Vk,N a ), k = N + 1, .. ., JV + N a 

of matrix (|76|) should be orthogonal to all its rows 
(0,i)--->Ci,JV,0,i)---,Cj,w„), 3 = 1,---,JV where z k ,j, 
Dk,ji £,k,j and £k,j are entries of blocks Z, Y, H and H 
of IA respectively. Below we find explicit expressions for 
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Z and Y in terms of the given 5 and 3 such that hi is 
unitary 

U^U = I N+Na (77) 

(or equivalently liW = Ijsr+N a )- 

Condition ([77]) is equivalent to the following set of 
equations for Z and Y 

5+5 + Z^Z = I N , (78) 

5+5 + y+y = , (79) 

5 t 5 + Z t y = 0. (80) 

Now we note that the condition hihi^ = lN+N a is an 
implication of (|77[) . This, in particular, means that the 
POVM condition ([70]) follows from ([?8 > (|80 ]) . Therefore 
we can replace Eq. ([79]) by Eq. ([70]) . From ([78]) we find 
matrix Z. For that we multiply (|70|) from the left by 
unitary matrix $+ and from the right by $ where $ = 
(SS+) _1 / 2 S which gives S+S+$+55+$ = In- Comparing 
this equation with ([78]) we conclude that Z^Z = $+55+$ 
and, hence, 

z = us+$ = vs+tss+j-vas , v- 1 = F+ (81) 

where V is iV a x iV arbitrary unitary matrix. Note that 
ZZ^ = V5+SV+ is N a x N a overlap matrix of the unitary 
rotated linearly independent by construction vectors £i, 
i = 1, . . . , N a collected as columns to 5 and, therefore, 
ZZ' is non-singular. Using this fact we find from (|80]) 

Y = -y(3 t 3)- 1 5 t (SS t ) 1 / 2 5 . (82) 

Note that for Y given in (|82[) condition ([79"]) is automati- 
cally satisfied which may be checked by a direct calcula- 
tion. 

It is important to note that if a non-singular 5 given 
in (J7JJ) is used in ([75]). in which case N a = N, Eqs. (1ST]) 
and (|82p assume simpler form 

Z = V$+5, (83) 

y = -y$+s. (84) 

These equations suggest the choice V = $ leading to the 
simplest expressions for Z and Y 

Z = E, y = -S, (85) 

and the following expression for hi [23| 

W = o- z (g)5 + cr :c (g)S, (86) 

where a x>z are the usual Pauli matrices. Since all Hilbcrt 
spaces of the same dimension are isomorphic to each 
other, 2A r -dimcnsional Hilbert space, which is the direct 
sum of two iV-dimensional spaces, Hn © Hn, is isomor- 
phic to 7i. 2 ® Hn- Relation ([8"6"]) may be used to map 
the operator hi presented by matrix ([76"]) in the space 
Hn © Hn to the space Hi ® Hn- 



V. CONCLUSION 

The main feature of our approach to optimization of 
the mean efficiency for discriminating among N given lin- 
early independent non-orthogonal states from methods 
usually used by previous authors consists in choosing a 
different form for the constraint to be imposed on opti- 
mization parameters to assure the existence of a POVM 
and, hence, the necessary probabilistic interpretation. 
The form of the constraint, we have used, permitted us 
to reduce the conditioned optimization problem for the 
mean efficiency to a problem of finding an unconditioned 
maximum of a function defined on an ./V-ellipsoid (Thco- 
rcm[T]) for the states given with different probabilities and 
on a unit ./V-sphere for equiprobable states ( Corollary [TJ. 
Using Theorem Q] we established the invariance of both 
the weight matrix and the optimal mean efficiency with 
respect to unitary rotations of the vector set. Therefore 
for any vector set $ the optimization procedure may be 
realized for its equivalent Hermitian form, $ = $+ with 
$ being a positive definite matrix. Using this fact for 
equiprobable states we succeeded to formulate a crite- 
rion when the optimal point is the symmetric point on 
the TV-sphere. This selects a set of matrices for which 
a numerical optimization becomes unnecessary. By the 
symmetric point on the sphere we mean a point with 
equal values of Cartesian coordinates when the sphere is 
centered at the origin. We have shown that for N = 2, 3 
only symmetric states satisfy this criterion but starting 
from N = A the indicated set becomes wider than the set 
of symmetric states. The whole family of states satisfying 
this criterion is still unspecified. 

We have also found a subset of N symmetric states 
which may be considered as the simplest generalization 
of two states since the expression for the optimal mean 
efficiency for two states is a particular case of a more 
general formula valid for the subset. The vectors from 
this subset are characterized by the following property. 
If $ is a matrix where the state vectors are collected as 
columns then has only two distinct eigenvalues one 
of which is N — 1 fold degenerate. 

Our approach is illustrated by examples with N = 
2,3,4 nonorthogonal states. 

Finally we presented our constructions of POVM and 
Neumark's extension. We indicated on an explicit pro- 
cedure how to construct a POVM for the case when the 
dimension of the ancilla space varies from 1 till N. As to 
Neumark's extension we presented a formula for the uni- 
tary matrix realizing the corresponding orthogonal iden- 
tity decomposition suitable for its use both in the Hilbert 
space Hn © Hn and Ti.2 ® Hn- 
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